Abstract: Cloud computing is developing as a new standard for deploying, organizing, and accessing large scale distributed computing applications over the network. In cloud computing, fault tolerance is a major problem and one of the metric which consider being most important since the resource failure affects job execution, throughput, response time and performance of system and network. Fault tolerance in load balancing is one of the main challenges in cloud computing, which is required to distribute the workload equally across all the nodes, detect the fault and remove fault from the network and share workload to all the nodes to increase the performance of cloud network. The load is an amount of work that a computation system performs, which can be classified as network load, storage capacity, memory capacity and CPU load. This paper describes a survey on fault tolerance, fault tolerance techniques, load balancing algorithm and load balancing schemes including fault tolerance in a cloud environment.

Keywords: Cloud computing, Load Balancing, Fault Tolerance, Load balancing, Static load balancing, Dynamic load balancing algorithm.